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The recent availability of large databases allows to study macroscopic properties of many complex 
systems. However, inferring a model from a fit of empirical data without any knowledge of the 
dynamics might lead to erroneous interpretations [6]. We illustrate this in the case of human 
mobility m and foraging human patterns [2] where empirical long-tailed distributions of jump 
sizes have been associated to scale-free super-diffusive random walks called Levy flights [5]. Here, 
we introduce a new class of accelerated random walks where the velocity changes due to acceleration 
kicks at random times, which combined with a peaked distribution of travel times [7] , displays a jump 
length distribution that could easily be misinterpreted as a truncated power law, but that is not 
governed by large fluctuations. This stochastic model allows us to explain empirical observations 
about the movements of 780,000 private vehicles in Italy, and more generally, to get a deeper 
quantitative understanding of human mobility. 


With the development of Information and Communication Technologies [32: the investigations’ focus shifted from 
the traditional travel diary surveys [33U35] to several new datasources. In particular, it became possible to follow 
individual trajectories from mobile phone calls GUISE location-sharing services [57]-i5ij| and microblogging [451 , 
or directly extracted from public transport ticketing system [2B] [55] or GPS tracks of taxis [m EH H3-H1], private 
cars DUS] or single individuals I3G3- For most data sources, the spatial positions r are the most reliable quantity. 
This information can be used for studying two different aspects of human mobility: (i) how far and (ii) where we 
are moving. The second question is far more complex than the first and can be approached with several different 
methodologies, from aggregated origin-destination matrices [45] suitable for mobility prediction |46j or land use 
analysis m, to individual mobility networks and patterns suitable for describing the natural tendency to return 
frequently to a few locations (such as homes, offices, etc) [51 151 1151 H51 1451 135] . On the other hand, the first question, 
generally characterized by the distribution of the spatial displacements P(Ar) across all users, although apparently 
simple is still far from being completely understood. Indeed, even if the study of P(Ar) has become a trademark for 
recent studies on human mobility, there are still no consensus about the functional form of this distribution. At a large 
scale (national or inter-urban), one may observe a long tail behavior .1] [21 HI EH S7j [35] 4EL ill 331 characterized by 
a power law decay for long displacements. At a smaller (urban) scale, the distribution seems to have an exponential 
tail PE5HI3 EH [Ml EH S3]. The unclear nature of this probability distribution makes its interpretation difficult and 
dependent on the dataset used, the scale and possible empirical errors (see Table [T]). It is therefore necessary to to 
obtain data as clean as possible, to propose a model that can be tested against empirical results. So far power law fits 
were used and led the authors to draw conclusions about the nature and mechanisms of the mobility, but this way of 
proceeding could actually lead to erroneous conclusions. 

Human travelling behaviour can be described as a sequence of resting times of duration r and instantaneous jumps 
A?’ in space [S]. These two processes need to be separated for modelling human mobility, since costs are in general 
associated to trips while a positive utility can be associated to activities performed during stops [E This approach 
is therefore fundamental for a large variety of complex processes in cities where mobility is central, such as traffic 
forecasting [§j, epidemics spreading mm or the evolution of cities GME 

The distribution of displacements P(Ar) plays a central role and several studies suggested the existence a fat tails 
(power law distribuitons) with a wide range of values for the exponent (see Table [I]). In contrast, at the urban scale, 
displacements seem to be described by exponential tails [MIS. A similar, unresolved controversy also exists about 
animal’s foraging movements: relying only on a fit of the empirical data, the same distribution can be understood in 
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different ways, leading to contrasting conclusions on the nature of the underlying process |18H20j . Remarkably enough, 
what appears to be underevaluated in the study of human mobility is the relevance of travel itself. The proposed models 
usually neglect the role of travel time and the moving velocity, also those indicating the hierarchy of transportation 
networks as the cause of this long tail [2Tj, assuming instantaneous jumps. This is essentially a consequence of 
the limitations inherent to datasources. Phone calls or social networks capture the spatial character of individuals’ 
movements 1221 , but they are limited by sampling rates or by the bursty nature of human communications |23] and 
are thus not suitable for an exhaustive temporal description of human mobility. 


Data Source 

Trajectories 

P 

K, 

Ar 0 

Dollar Bills [1] 

464K 

1.59 

OO 

0 

Mobile Phones [2] 

100K 

1.75 

400 km 

1.5 km 

Mobile Phones [3] 

206 

1.75 

80 km 

1.5 km 

Mobile Phones [8] 

3M 

1.55 

100 km 

0 

Location Sharing [37] 

220K 

1.88 

OO 

0 

GPS tracks [3] 

101 

[1.16,1.82] 

oo 

0 

Location Sharing [5S] 

900K 

1.50 

oo 

2.87 km 

Location Sharing [38] 

900K 

4.67 

oo 

18.42 km 

Taxis [17] 

12K 

0 

4.29 km 

- 

Taxis [4T] 

7K 

1.2 

10 km 

0.31 km 

Mobile Phones .'10 

7K 

0 

[2, 5.8] km 

- 

Travel Diaries [34] 

230 

1.05 

50 km 

0 

Tweets [40 

13M 

1.62 

OO 

0 

Location Sharing [35] 

521K 

0 

300 km 

- 

Taxis [42] 

30K 

0 

[2, 4.6] km 

- 

Taxis [44] 

1100 

[0.50,1.17] 

[4.5, 6.5] km 

0 


TABLE I: Parameter values for the fit of the displacement distribution with a truncated power law found in previous studies. 
This list includes studies on different datasources and spatial or temporal scales. Only fits consistent with the function 
P(Ar) = (A r + Aro) _/3 exp(— —) are presented here. The case k = oo is associated to non-truncated power laws, while /3 = 0 
to exponential distributions. Aro = 0 correspond to cases where this parameter was omitted in the fit, while ft = 0 when this 
value is not defined. Further studies propose: i) a polynomial form close to an exponential behavior for Private Cars El ; ii) 
two different behaviours for urban and inter-urban trajectories for Cars and Taxis EE; iii) a lognormal distribution for 
individual GPS tracks |25| . 


In this paper, we show that the observed truncated power laws in the jump size distribution can be the consequence 
of simple processes such as random walks with random velocities [53] . We test this model over a large GPS database 
describing the mobility of 780,000 private vehicles in Italy, where travels and pauses can be easily separated, as the 
transition is identified by the moment when the engine is turned on or off (but we introduce a lower threshold of 5 
minutes in the elapsed time, to distinguish real stops from accidentaly switched off of the engine during a trip). This 
allows us to evaluate accurately not only the displacements A?’, but also travel-times t, speeds v, and rest times r. 
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FIG. 1: Travel time in the 6 largest Italian cities. The probability distribution is an exponential function and the 
difference between cities is therefore fully encoded in a single parameter: the average travel-time t. 


We first analyze the distribution P(t) of the travel times of individual vehicle in different cities. We observe in 
Fig. [I] that P{t) is characterized by an exponential decay as in [71125]. For public transport systems, we also observe a 
rapidly decreasing tail for the travel-times between metro stations in London evaluated from the Oyster card ticketing 


system (see [26] and Fig. SI left). A short tail is expected for these distributions, since the total daily travel-time 
spent in public m or private [7] transportation has an exponential tail. 

The individual mobility is composed by essentially two different phases, travels and rests, both having a finite 
duration. Studying times instead of distances allows to exclude the variability due to the different speeds, which may 
contribute to the perceived difference in travel costs. When travel costs are homogeneous, their distribution is expected 
to be exponential [34]. Similarly to what is observed for total daily travel-times [2 nni [2z], the exponential distribution 
appears in a variety of context such as the difference between independent events or maximizing the entropy at fixed 
average. We observe that the distribution of cars’ travel-times is indeed characterized by an exponential tail (Fig. [l]). 

For public transport systems, we also observe a rapidly decreasing tail for the travel-times between metro stations 
in London evaluated from the oyster card ticketing system. (See [25j and Fig. |S1| left). Travel times for cars however 
depend on the city population [14] : the largest the city, and the worse congestion effects. This is confirmed by our 
results: if we bin the drivers according to their city of residence (see Methods), we find that the average travel¬ 
time in different cities falls in the range [9 min, 18 min], with a growth correlated with population ( t ) oc P^ 1 , where 
H = 0.07 ± 0.02 (Fig. S2). As a consequence of this, the average travel time differs from a city to another one (Fig. SI 
right), but if we rescale the time by its average value, we observe in Fig. [l] that the distribution P(t/t ) appears to be 
universal across different cities and given by 


P{t) = =e~ trt 


(1) 


The average value t contains then all the information needed for describing cars’ travel times in a particular city. 


Rest times 

The distribution of rest times P(r) does not display remarkable differences between cities for stops within 24 hours 
(see Fig. [2] left). Consistently with the delay time distribution in email communications H3[5U], this distribution is 
close to a power law with exponent ~ — 1 [16j (for times shorter than 12 hours). A lognormal fit for the tail seems 
also reasonable m- In Fig. fright, we show the best fits with these functions, while we also propose a fit with a 
stretched exponential that, although it represent a good fit for all times between 5 minutes and 30 days, lacks any 
theoretical explanation. 
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FIG. 2: Pause times distribution of private cars. (Left) The distribution shows little variation among the six largest 
cities in Italy. (Right) Considering all the individuals across the whole Italy, we show that the distribution appear to follow 
a power law P(r) oc t~ v with 77 = 1.03 ± 0.01, valid for pauses shorter than 4 hours (yellow circles), which represents 74% 
of the pauses. Conversely, a lognormal distribution lnA/^/x, a) with /x = 1.6 ± 0.6 h and a = 1.6 ± 0.1 h could be proposed 
to characterize the tail for pauses longer than 1 hour (orange squares). A stretched exponential: P(r) oc exp(— (t/to)^) with 
(3 = 0.19 ± 0.01 and to = 10 ± 5 ■ 1CP 5 h successfully fits the whole curve at all order of magnitude (red solid line). 


Dependence of velocities over travel-time 



FIG. 3: Acceleration of private transportation. Empirical average velocities versus the duration of the trip for our GPS 
dataset. The solid line represents a constant acceleration v = Vo + at with vo = 17.9 km/h and a = 16.7 km/h 2 . 

In order to describe the evolution of velocities, we consider a very general one-dimensional example of randomly 
accelerated walk, with uncorrelated accelerations 

v = VOW) (2) 

where £ is the white noise and 6 is time. This process yields to a Gaussian-like distribution of velocity and which leads 
for its absolute value P(\v\,6) oc exp 2 I 50 ) ■ Combining this result with the exponential distribution of travel-times 
P(t) ~ e _t /t ; an d using a saddle-point approximation, we obtain for large displacements a stretched exponential of 
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the form 


P(Ar) oc Ar 7 exp(— CAr s ) 


(3) 


with 7 = 1/4 and S = 1/2. We show in Fig. 16] the best-fit with this curve, indicated as “Stretched Exponential”. This 
fit with a single parameter (C = 0.63 km -0 "’) would already offer a reasonable description of an empirical distribution 


that is also fitted by a truncated power law with 3 parameters [2] (See also Fig. S 6 ). In addition, the simple random 
model for velocities described by Eq. ([2]) induces a relation [28] between the travel-time t and the average speed (|u|) 
of the form (|i;|) oc t 1 / 2 , which as we will show, is not consistent with empirical data. Similarly to what was proposed 
for the foraging movements of animals and humans, we are mixing here short and slow with long and fast jumps, but 
instead of alternating between behavioral phases where the trajectories have different speed and autocorrelations [29] . 
we have here a continuous distribution of velocities due to the structure of the mobility network. However, for both 
private cars (Fig. [3]) and public transport (Fig. S3 left), we find that average velocities grow linearly for large travel 
times, and not as a square root. We show here that this apparent constant acceleration results from an optimal use 
for longer trips of the hierarchical nature of the transportation networks. Indeed, it is likely that faster transportation 
modes or faster roads are used more frequently for longer trajectories |f 6 ], 30 . In the following, we will propose a 
simple stochastic phenomenon based on this idea. 


Randomly accelerated walkers 



FIG. 4: Schematic representation of the model. The trajectory starts with an acceleration phase, where the velocity is 
increased by constant kicks equal to 5v. These kicks happen at random times with an uniform probability per unit time p. 
When approaching to the destination, there is a deceleration phase with kicks of -5v and the same probability p. Given the 
symmetry of the problem, the maximal speed v* is reached on average at £ m = t/2. This maximal speed depends on 5v and 
p: v m = Vo + k(tmp)5v, where k( A) follows a Poisson distribution of average A. The average speed v of the trajectory can be 
evaluated by approximating the step function which defines the light blue area, as v = (no + v m )/2. 


In this simple model, we assume that the transportation network has n layers L n , corresponding to different travel 
speeds v n (see the schematic representation in Fig. [4]). For the sake of simplicity, we assume that the velocity 
differences between layers are constant and equal to 6v. An individual starts his trip of duration t in Lq, with base 
speed vq, while on layer L he is traveling faster at speed Vk = Vq + k5v. We assume that a trip is composed of two 
phases. In the first one, the trajectory progressively jumps from slower to faster transportation modes. In the second 
phase, there is the inverse process where the trajectory progressively jumps down the layers until it finally reaches the 
base layer at time t. We also assume that the process has a Poissonian character, where all individuals have the same 
probability per unit time p to jump to the successive layer and to change their velocity (both in the ascending and 
descending phases). Together with the last assumption that on average the duration of both phases are equal and 
neglecting any saturation issue (see Methods), we can: i) estimate the maximal speed v m = vq + k(t m )Sv, where k(t m ) 
is the number of jumps at mid trajectory t m = t/ 2 ; ii) approximate the average speed v = 1/t f Q v(t)dv = (to +v m )/2 
(see Fig. [4]). Since the process is Poissonian, ( k(t m )) = pt m and the average speed is then given by 

(v(t)) =v 0 + ^-t (4) 

where the brackets denote the average over the Poisson variable k. The average speed thus grows linearly with t before 
reaching the saturation imposed by the finite number of layers. Remarkably enough, this simple model allows us also 
to predict the shape of the conditional probability distribution P(v\t). Indeed, the number of jumps k is distributed 
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following the Poisson distribution P(k) = e fc , A with A = pt/2. Using the Gamma function as the natural analytic 
continuation of the factorial k\ = T(1 + k), we obtain the distribution 


P{v\t) 


1 exp (-p't + ^ log (p't)) 

6v' r(i + ^) 


(5) 


where p' = pf 2 and Sv' = Sv/2. In Fig[5]we show that the empirical distribution of the velocities at fixed time P(v\t) 
for travel-times ranging between 5 and 180 minutes are consistent with Eq. (©) with t = 0.30h, Sv' = 20.9 Km/h, 
p' = 1.06 jumps/h and vq = 17.9 km/h. This makes the velocity gaps 6v ss 40 km/h, consistently with the progression 
of the most common speed limits in Italy: 50 km/h (urban), 90 km/h (extra-urban), 130 km/h (highways). This result 
suggests that a multilayer hierarchical transportation infrastructure can explain the constant acceleration observed in 
both public and private transportation. This model also allows to estimate the base speed Vq as the intercept value 
in Fig. |3] (b) and (d), and the acceleration a is expected to be proportional to the probability of jump to faster layers 
p and to the gap between layers Sv (see Methods). 



FIG. 5: Conditional distribution P(v\t) of average velocities at different travel times. We fit the curve P(v\t) in the 
interval [uo, u ma x], with vo = 17.9 km/h and Umax = 130 km/h. We fit simultaneously all the curves for t = 5, 10,15, 20,..., 180 
minutes, and we show here six examples represented by blue dots. The best fit curve, represented in red, is given by Eq. © 
with p' = 1.06 jumps/h and Sv 1 = 20.9 km/h. 


Displacement distribution 

Finally, the shape of the displacement distribution P(Ar) can be computed as a superimposition of Poisson distri¬ 
butions (see Fig. fright) 

/■OO rV 2 

P(Ar) = / dt dvS(Ar — vt)P(t)P(v\t) 

J 0 J v i 

_ r°° dt ex p (- y +1) *+ iQ g(p'*) Ar £r° ) 

-Jo tt Avt(i+ Ar/ s t J V0 ) 

where <5 is the Dirac delta function. The exact form for Eq. © cannot be analytically computed but the limiting 
behavior of for large A?’ is again Eq. ([3]) (see Supplementary information). In particular, this distribution is not broad, 
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A r (km) 


FIG. 6: Distribution of displacements (Left) . We show (blue circle) the aggregated empirical distribution P( Ar) for 
all the « 780, 000 cars in our dataset. The orange dashed line represents a fit with a truncated power law of the form 
P{ Ar) oc (Ar + Aro) _,s exp(—Ar/fc) with /? = 1.67, k = 84.1 km, Aro = 0.63 km. The coefficients of the fit are close to 
those found for mobile phone data in the US [2]. The yellow dots represents the fit with the stretched exponential Eq. 

(C = 0.63km °) associated to uncorrelated accelerations model. The red solid line shows the prediction based on our model 

Eq. § with i) vo = 17.9 km/h estimated as the intercept of the fit in Fig. [ 3 ] ii) p' = 1.06 jumps/h, and 5v = 20.9 km/h 
estimated from the fit of P(v\t) in figure [ 5 ] iii) (t) = 0.30 li coming from the average of the travel-times for all selected trips 
(see Methods). This prediction is of remarkable quality that can reasonably compared with the commonly used direct fit with 
a truncated power law with 3 free parameters. (Right) Visual representation of the integral in Eq. ©• The P(Ar) (blue line) 
is obtained as the superimposition of Poisson distributions of Eq. |5| with different A = p't. We display the distribution for 
t = 5,10,15,..., 180 minutes with a colormap going from red to yellow. The domain of these curves is limited by velocities 
between vo and 130 km/h. The area under each curve is decreasing with time as oc e _i/,J . 


in clear contrast with Levy flights which have divergent moments and are governed by large fluctuations. Since the 
distribution P(Ar) is not a fat tail distribution, all the phenomena associated to Levy flights, such as super-diffusion 
for example, are not expected for randomly accelerated walks. In Fig. [6] left, we show that our prediction on the shape 
of P(Ar) is consistent with the empirical data. Similar results can be obtained studying the mobility of single cities 
(see Fig. [S6|. We stress that the curve we propose is not an a posteriori best fit of the empirical P(Ar) but the curve 
predicted from Eq. § knowing the average travel-time t and using the values Vq, p', Sv' used in the description of 
the empirical P(v\t). 

The fact that we can fit the data by different forms, as the (truncated) power law fit proposed in various studies [UiJ. 
(see Table]!]) or the stretched exponential of Eq. (]3| , is an illustration of the difficulty to extract mechanistic information 
from empirical data only using macroscopic statistical laws, without taking into account the dynamical properties of 
the underlying processes. In particular, urban mobility seems thus not to be a Levy process as inferred from truncated 
power law fits, but its random nature is governed by transitions between modes or roads with different velocities. In 
contrast with purely empirical works that led to Levy flight models, our model is based on the interaction between 
individuals and the transportation networks [5], Our approach accounts for the displacement distributions, but also 
provide a possible explanation of the processes generating such macroscopic patterns. 








Methods 


Private Transportation Data 

We compute spatial displacements and travel times for private transportation from a database of GPS measures 
describing the trajectories of private vehicles in the whole Italy during the month of May 2011. This database is 
mainly set up by private vehicles, since Taxi or delivery companies use their own GPS systems and do not contribute 
to the database. A small percentage of vehicles belongs to private companies and are used for professional reasons. 
This database includes ~ 2% of the vehicles registered in Italy, containing a total of 128,363,000 trips performed 
by 779, 000 vehicles. Records contain information about engine starts and stops, and travel-times also include the 
eventual time spent looking for parking. When the quality of the satellite signal is good, we have an average spatial 
accuracy of the order of 10 meters, but in some cases it can reach values up to 30 meters or more M- The temporal 
resolution is of the order of the second. 

We have applied correction and filtering procedures to exclude from our analysis the data affected by systematic 
errors. Approximately 10% of the data were discarded for this reason. When the engine is switched on or when the 
vehicle is parked inside a building, there are some errors due to the signal loss. In such cases, we use the redundant 
information given by the previous stopping point to correct 20% of the data When the engine’s was off for less than 
30 seconds, the subsequent trajectory is considered as a continuation of the same trip if it is not going back towards 
the origin of the first trajectory. 

For privacy reasons, the drivers’ city of residence is unknown. Therefore, it has been necessary to associate each 
car to an urban area using the available information. We do that identifying a driver as living in a certain city if 
the most part of its parking time was spent in the corresponding municipality area. Then, for each driver we have 
considered all the mobility performed, both within and outside the urban area. 


Details of the model 

The model proposed here assumes that urban mobility is performed at an (euclidean) speed of vq ss 18 km/h. Very 
short trips hardly reach the base speed vq we impose in our model. For this reason, in all the measures proposed in 
this paper, we considered only trips longer than 1 km and 5 minutes. 

The upper bound to 130 km/h limits the number of jumps to k = (130km/h — vq )/Sv ~ 3. We a jumping rate 
of order p' « 1 jump/h, we may expect significant deviations from our prediction for trips longer than 4 hours. For 
long trips one cannot in principle approximate the area below the step function in Fig. [4] with a triangle. Indeed, the 
Poisson fit in Fig(5] overestimates the frequency of very fast trips for t = 120 and 180 minutes. If the acceleration 
kicks are limited by a finite number of layers, the speed grows linear for small times and converge asymptotically to 
a limiting speed (see Supplementary Information). 
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Supplementary information 


Trip duration in public and private transportation 




FIG. SI: Travel-times distributions in different cities and modes of transport. (Left) Travel-times extracted from a 
5% sample of all Oyster card journeys performed in a week during November 2009 on bus, Tube, DLR and London Overground. 
Data available at https://api-portal.tfl.gov.uk/docs (accessed 3/9/2015). The probability distribution is fitted with the curve 
P(t) oc (1 — exp(— t/c)) exp(—cexp(— t/c)/b — t/b) proposed in [7] for the description of daily travel time expenditures. The 
parameters c = 36 ± 2 minutes and b = 9.3 ± 0.3 minutes show that the average travel-time given by the exponential tail is 
of order of 9 minutes, but at least 36 minutes are needed for reaching this exponential behavior. (Right) In contrast with 
Fig. 1, where the travel-times of private cars were normalized, here we display the differences between the distributions for the 
6 largest cities in Italy. The tail appears to be exponential for travel-times shorter than the classical value of daily travel-time 
budget of 1 hour |7J [54]. 



FIG. S2: Variation of the average travel-time of private cars across Italian cities. We display here the values for all 
cities with at last 500 drivers that are monitored. The average travel-time (computed here over all trips in a given city) is in 
the approximate range [9,18] minutes. These values are correlated r = 0.40 with the city population. The solid line represents 
a fit for a power law t oc population 7 with 7 = 0.07 ± 0.02. 
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Dependence of velocities over time in public transportation. 



FIG. S3: Acceleration of public transportation. (Left) We observe also for public transportation trajectories (linking 
all possible origins and destinations in Great Britain) a uniform acceleration for trips of duration between 30 minutes and 3.5 
hours. The solid red line represents the fit with Vo = 7.4 ± 0.4 km/h and a = 12.0 ± 0.2 km/h 2 . In this case, very short trips 
appear as faster thanks to the likely absence of connections. (Right) Conditioned speed distribution for trajectories originated 
in Charing Cross in London. Selecting trips of duration shorter than 1 hour we have manifestly a normal distribution. 


We estimate velocities for public transportation trips from an open dataset providing a complete snapshot of the 
multilayer temporal network of public transport in Great Britain in October 2010 I oil . This dataset assumes a waiting 
time before a flight of 2 hours, a waiting time after a flight of 30 minutes and, where not explicitly defined by the 
transportation agencies, a walking distance of 250 m. The starting time in our analysis of a week is Monday, 8am. 
Starting from that time, we consider for all trips from a node i to a node j, the time spent in transportation after 
departing from the node i in any direction. The shortest time-respecting path [30| is then identified with a Dijskstra 
algorithm, where also the time spent walking (at 5 km/h) between two adjacent stops of any modes of transport and 
the time spent waiting at the connection is integrated to the total travel-time. The euclidean distance between origin 
and destination is then used for estimating velocities. 

Both the averages and the distribution of Fig. S3 are not computed on real flows, which are not available with the 
same spatial extension and definition of the dataset. In this study, we implicitly assumed a uniform travel demand 
by including all possible origin-destination pairs. 


We observe a similar linear growth for the average velocities of trajectories in the public transport system (Fig S3 


left). In this case we do not have actual individual trajectories but if we assume that there is a uniform travel demand 
and that travellers make the shortest time-respecting path between origin and destination ED] , we can estimate 
velocities. Short trips, of less than 30 minutes, tend to be faster thanks to the likely absence of time-consuming 
connections. For t € [0.5, 3.5] hours, the growth is again linear but both base speed v 0 = 7.4 ± 0.4 km/h and 
acceleration a = 12.0 ± 0.2 km/h 2 are smaller than in the case of private transport. In Fig S3 right we show that for 
urban trajectories in London (t < 1 h from Charing Cross) the distribution P(v) is universal and gaussian-like. The 
difference between the exponential shape of P(v) for private transport and the gaussian-like distribution for public 
transport might originate in the fact that in public transport the average speed is the weighted average of velocities 
associated to each edge of the transportation network (plus the effect of waiting time), leading to a normal distribution 
according to the central limit theorem [53] . In principle, public transports move according to a deterministic dynamics 
and arrival times at stations is fixed. However, fluctuations in the average velocity of a trip lead to small delays at 
different stations and when summed together -for a trip with connections - lead to a Gaussian distribution. 
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FIG. S4: Average speed in a one-dimensional multilayer hierarchical transportation infrastructure. (Left) The 

dotted line in the plane (9, x) represents a trajectory where a time t* is spent on the base layer with speed Vo (red dots) and 
a time 9 — t* is spent travelling at speed v\ on the fast layer (green dashed line). The average speed v is simply the weighted 
average of the two speeds (blue solid line). (Right) A numerical simulation confirms the result of Eq. (black dashed line) for 
the case of two layers. We also simulated the case of more than two layers where the linear growth regime v ~ Vo + \p(v i — Vo) 
(blue solid line) extends beyond pt <C 1. 


Accelerated walker with a limited number of layers 

We consider the one dimensional case (the extension to 2d is simple) for a transportation network with only 2 
possible layers: Lq and L i corresponding to different travel velocities. On the layer Lq, individuals travel with speed 
vq, while on L\ they are traveling faster at speed V\ = Vq + dv. An individual starts its trip of duration t in Lq and has 
a probability per unit time p to jump to layer L\ and to increase its speed. Being a Poisson process, the probability 
to jump at time t* is then given by 


P(t* =9)= pe~ 


(7) 


and the position at time t6 of the traveller is 

x{6) = v 0 6H(t* -9) + H{6 - t*) [v 0 t* + - t*)] 

where H(x) is the Heaviside function (see Fig. [S4| left). 

By averaging the position over t*, we obtain for the average speed v — x/t the following simple expression 


( 8 ) 


>(t) =vi + (v 0 - t>i) 


1 - e~ pt 
pt 


(9) 


(the bar 7 denotes the average over different trips). For the limiting case pt <C 1 the average speed grows linearly from 
the base value Vo'- v(pt -C 1) ~ vq + \pt{y\ — Vo), while for pt 3> 1 the average speed converges asymptotically to v-\: 
v(pt 1) ~ v\ — ( v\ — vo)/pt. This simple model thus recovers, in some regime, the linear growth of speed with the 
duration of the trip, and also the tendency to reach a limiting speed observed in Fig [3] 

If, as in the model we propose in this paper, we have more than two layers with the same jumping probability and 
speed gap v n +± — v n = Sv between consecutive layers L n and L n+ 1 , the constant acceleration regime is extended up 
to pt > 1 (see Fig. S4 right). This result suggests that a multilayer hierarchical transportation infrastructure can 


explain the constant acceleration observed in both public and private transportation. From this model, we can also 
estimate the base speed Vq with the value of the intercept in the Figures [3j [S3] left, and |S5[ while the acceleration a 
is expected to be proportional to the probability of jump to faster layers p and to the gap between layers Sv. 
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FIG. S5: Speed growth of cars in the 6 largest Italian cities. The speed growth for single cities is less uniform than in 
the aggregate case of Fig. [3] This is probably related to inhomogeneities of the road network and geographical characteristics 
of the region around each city. All curves share a similar value of v = 16.5 ± 0.5 km/h for t — 5 min, therefore this constant 
value has been selected as Vo for the prediction proposed in Fig. |S6| 


Variability of P( Ar) among cities 


We note that for single cities there are of course local differences and deviations from the straight line that are 
probably due to the in-homogeneity of structure of the road network, but as can be seen in Fig. [S5] the linear trend 
is a good fit. Moreover, All cities seem to have a similar average speed of « 16.5 km/h at t = 5 minutes. We use this 
value as vq and estimate, for each of the 6 cities, p' and Sv' by fitting the conditional probability P(v\t) between 
and 130 km/h. Finally, we also compute the average travel-time t for each city This allows us to predict the exact 
shape of P(Ar) for trajectories of drivers living in different cities. In Fig. S6 we show that this prediction (red solid 
line) is correct, and can be compared with the a-posteriori fit with a Truncated Power Law (orange dash line) or the 
Stretched Exponential, Eq. ^ (yellow dots). 
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FIG. S6: Displacement distribution for the 6 largest Italian cities. Similarly to Fig.[6]left, we propose here the prediction 
of our model for the distribution P(Ar) in the six largest Italian cities. The blue circles represent the empirical distribution. 
The orange dash line represents the fit with a Truncated Power Law [2] and the yellow dots with a Stretched Exponential 
(Eq. |3|. The red solid line the prediction Eq. (jb|. The fit values for the cities are in the ranges p' = [0.97,1.81] jumps//i, 
Sv' = [13.2, 18.5] km/h, f = [0.30,0.37] h while Vo = 16.5 km/h. 
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Saddle-point analysis of P(Ar). 


The expression for P(Ar) can be rewritten as 


( 10 ) 


where 


F(Ar, t) 



r -j^\og{p't) + logT 



r - vpt \ 
Sv't ) 


( 11 ) 


At large r and t this integral is governed by the saddle point defined by dF/dt = 0 which reads 




r 


r - v 0 t 
t 2 


Sv't 2 


$ 1 + 


r - Vpt 
Sv't 


( 12 ) 


where ’h is the Digamma function. We then obtain for large times, the scaling r ~ t 2 , which implies the following 
behavior 


P(Ar) - e- Arl/2 (13) 

We note that this behavior will also be recovered even if we have Gaussian velocity distribution as it is observed for 
public transportation. 



